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Abstract — The continuous-time, peak-limited, inflnite- 
bandwidtli Poisson channel with spurious counts is considered. 
It is shown that if the times at which the spurious counts occur 
are known noncausally to the transmitter but not to the receiver, 
then the capacity is equal to that of the Poisson channel with 
no spurious counts. Knowing the times at which the spurious 
counts occur only causally at the transmitter does not increase 
capacity. 

I. Introduction 

Communication aided by channel-state information (CSI) at 
the transmitter provides a rich source for interesting problems 
in information theory. A discrete memoryless channel (DMC) 
with independent and identically distributed (i.i.d.) states is 
described by a transition law 

Pt(Y ^y\X ^ x,S ^ s) = W{y\x,s) (la) 

and a state law 

Ps{s), (lb) 

where X denotes the channel input, Y denotes the channel 
output, and S denotes the state of the channel. 

Shannon [1] studied the case where the CSI is unknown to 
the receiver and known causally to the transmitter In this sce- 
nario, before sending x^, the transmitter knows {si, . . . , s^}. 
He showed that the capacity of the channel ^ is given by 

sup/(C/;y), (2) 

Pu 

where [/ is a random variable taking value in the set of 
mappings from channel states to channel inputs, and where 
Py\u is given by 

PY\uiy\u) = Y.Ps{s)Wiy\u{s),s). 

s 

A different scenario is where the CSI is still unknown to 
the receiver but known noncausally to the transmitter. In this 
case, the transmitter knows the whole state sequence before 
starting to transmit. The capacity of ([TJ in this case was found 
by Gel'fand and Pinsker [2] to be 

sup I{U;Y)-I(U;S), (3) 

U'^-(X,S)^-Y 

where the supremum is over all joint distributions of the form 

Pu{u)Px,s\u{x, s\u)W{y\x, s) 
whose marginal distribution on S is Ps{-)- 



The capacity with noncausal CSI was computed in various 
cases. The case of writing on binary memory with defects was 
solved by Kusnetsov and Tsybakov [3] before the discovery 
of the formula (O; the case of writing on binary memory with 
defects and noise was solved by Heegard and El Gamal [4]; 
and the case of additive Gaussian noise channel with additive 
states ("writing on dirty paper") was solved by Costa [5]. In 
these cases, capacity was found to be equal to the capacity 
when there are no states at all. 

In the present work, we consider the continuous-time, peak- 
limited Poisson channel with CSI at the transmitter. This 
channel model without states was studied in [6], [7], and [8]. 
In [9], Bross and Shamai considered the Poisson arbitrarily 
varying channel (AVC) with states. In their set-up the states 
correspond to a nonnegative signal that is added to the channel 
input. 

We consider a model where the channel states correspond to 
spurious counts at the receiver, and where the receiver does not 
know which counts are spurious. We study first the case where 
the CSI is known noncausally to the transmitter, and then the 
case where the CSI is known causally to the transmitter 

In the noncausal CSI case, we distinguish between two 
settings. In the first setting we assume that the states are chosen 
by an adversary subject to a constraint on the average number 
of spurious counts per second, and we allow the transmitter 
and the receiver to use random codes. Since the state sequence 
can be arbitrary, we cannot use Gel'fand and Pinsker's formula 
(O. Instead, as in [3] and [4], we show that the capacity 
with no spurious counts can be achieved on this channel 
with random codes by construction. In the second setting we 
assume that the spurious counts are random (not necessarily a 
homogeneous Poisson process). Using the result from the first 
setting, we show that the capacity with no spurious counts is 
achievable on this channel with deterministic codes. 

For the causal CSI setting, we show that, even if the spurious 
counts obey a homogeneous Poisson law, causal CSI does not 
increase the capacity of this channel. Thus, as in [3] and [4], 
in our channel causal CSI does not increase capacity at all, 
while noncausal CSI increases it to that of the same channel 
model but without states. 

The rest of this paper is arranged as follows: in Section 
|ll]we recall some important results regarding the peak-limited 
Poisson channel; in Section|lII]we state and prove the capacity 
results for the noncausal CSI case; and in Section |IV] we 
discuss the causal CSI case. 



II. Preliminary: the Poisson Channel 

In this section we recall some important results regarding 
the peak-limited Poisson channel. This channel's time-< input 
x{t) is a nonnegative real, and its time-t output y{t) is a 
nonnegative integer. For a given input signal x{t), t G M, the 
output Y{t), t G R is a Poisson process whose time-t intensity 
is x{t) + A, where A is a nonnegative constant called the dark 
current. 

We impose a peak-power constraint on the input so 



x{t) <A, te 



(4) 



where A > is the maximal allowed input power We denote 
the capacity of this channel (in bits per second) by C{A, A). 

The value of C {A, A) was first found by Kabanov [6] 
and Davis [7] using martingale techniques. Later, Wyner [8] 
showed that C {A, A) can be achieved by dividing the channel 
into small time-slots and then looking at the resulting DMC. 

To derive an achievability result, we follow Wyner and 
discretize this channel in time with every time-slot being A 
seconds long. We further restrict the input distribution so that 
within each time-slot the input is constant: either A or 0. 
The receiver front-end produces if there were no counts in 
the time-slot, and 1 if there were one or more counts. This 
discrete-time channel is memoryless and for small A its law 
can be approximated by 



W{l\x) 



(5) 



AA, x = 

(yl + A)A, x^l. 

We denote the capacity of the channel (|5]l (in bits per channel 
use) by Ca(A, A). Wyner [8] showed that 

^ ^ AiO A 



max 

pe(o,: 



C(A, A) = max {pA + X)D Ber , ^ ^ 
pe(o,i) V \ pA + \ 



{p(A + A) log(A + A) + (1 - p)\ log A 

- (pA + X) logip A + X)}. 
We note that C {A, A) can also be written as 

+ Be„p)Y 

(6) 

where Ber (tt) denotes the Bernoulli distribution of parame- 
ter TT. 

III. NONCAUSAL CSI 

We now consider the continuous-time Poisson channel as 
described in Section [III but with spurious counts occurring at 
the receiver. We assume that the times of these occurrences are 
known noncausally to the transmitter but not to the receiver 
We consider two settings: for Section IIII-AI we assume that 
the spurious counts are generated by a malicious adversary and 
that the transmitter and the receiver are allowed to use random 
codes; for Section IIII-BI we assume that the spurious counts 
occur randomly according to a law known to both transmitter 
and receiver and that only deterministic codes are allowed. 



A. Random Codes against Arbitrary States 

Consider the Poisson channel as described in Section |ll] 
but with an adversary who generates spurious counts at the 
receiver and reveals the times at which they occur to the trans- 
mitter before communication begins. These spurious counts 
can be modeled by a counting signal s( ), which is a nonneg- 
ative, integer-valued, monotonically increasing function. Thus, 
conditional on the input being x{-), the output is given by 



Y{t) = Z{t) + s{t), te 



(7) 



where Z{t) is a Poisson process whose time-t intensity is 
x{t) + X. We assume that the adversary is subject to the 
restriction that, within each transmission block, the average 
number of spurious counts per second cannot exceed a certain 
constant ly which is known to both transmitter and receiver. 

In a (r, R) (deterministic) code, the encoder maps the 
message M G {!,..., 2^-^} and the channel state s{t), 
t G [0,r], to an input signal x{t), t G [0,r], and the 
decoder guesses the message M from the channel output y{t), 
t G [0, r]. A (T, R) random code is a probability distribution 
on all deterministic (T, R) codes Q 

A rate R (in bits per second) is said to be achievable with 
random codes on the channel O if, for every T > 0, there 
exists a random (T, R) code such that, as T tends to infinity, 
the average probability of a guessing error tends to zero for all 
possible s(-). The random coding capacity of this channel is 
defined as the supremum over all rates achievable with random 
codes. 

Since the adversary can choose not to introduce any spu- 
rious counts, the random coding capacity of the channel ^ 
is upper-bounded by C{A, A), which is given in Q. Our first 
result is that this bound is tight: 

Theorem 1: For any positive A, X and ly, the random coding 
capacity of the channel (|7]), where s(-) is known noncausally 
to the transmitter but unknown to the receiver, is equal to 
C{A,X). 

Proof: We only need to prove the lower bound. Namely, 
we need to show that any rate below the right-hand side (RHS) 
of (|6]l is achievable with random codes on the channel (|7|. 
To this end, for fixed positive constants T, R and a, we 
shall construct a block coding scheme to transmit RT bits of 
information using (1 + a)T seconds. (Later we shall choose 
a arbitrarily small.) We divide the block into two phases, first 
the training phase and then the information phase, where the 
training phase is aT seconds long and the information phase 
is T seconds long. Within each phase we make the same 
discretization in time as in Section [III where every time-slot 
is A seconds long. We choose A to be small enough so that 
AA, AA and vA are all small compared to one. In this case, 
our channel is reduced to a DMC whose transition law can be 



'For more explicit formulations of random and deterministic codes, see 
[10] and references therein. 



approximated by: 



W{l\x,s) = 




A)A, 



X = 1, s = 
s = l. 



(8) 



Here x = 1 means that the continuous-time channel input in 
the time-slot is A, and x — means that the continuous-time 
channel input is zero; y — I means that at least one count 
is detected at the receiver in the time-slot, and y = means 
that no counts are detected; s = 1 means that there is at least 
one spurious count in the time-slot, so the output is stuck at 
1, and s = means that there are no spurious counts in the 
time-slot, so the channel is the same as (|5]l. From now on we 
shall refer to time-slots where s = 1 as "stuck slots." 

Denote the total number of stuck slots in the information 
phase by /iT. Then, by the state constraint. 



< (1 + a)v. 



(9) 



In the training phase the transmitter tells the receiver the value 
of /iT. To do this, the transmitter and the receiver use the 
channel as an AVC. Namely, in this phase the transmitter 
ignores his knowledge about the times of the spurious counts. 
Since, by the state constraint, the total number of stuck slots 
in the whole transmission block cannot exceed (1 + a)vT, 
we know that the total number of stuck slots in the training 
phase also cannot exceed (1 + a)vT. It can be easily verified 
using the formula for random coding capacity of the AVC 
with state constraints [11] that the random coding capacity 
of the AVC dHJ under this constraint is proportional to A for 
small A. Thus, the amount of information that can be reliably 
transmitted in the training phase is proportional to aT for large 
T and small A. On the other hand, according to (|9]l, we only 
need log((l + a)i>T^ bits to describe /iT. Thus we conclude 
that, for any a > 0, for large enough T and small enough A, 
the training phase can be accomplished successfully with high 
probability. 

We next describe the information phase (which is T seconds 
long). If the training phase is successful, then in the informa- 
tion phase, both the transmitter and the receiver know the total 
number of stuck slots /iT, but only the transmitter knows their 
positions. With such knowledge, they can use the following 
random coding scheme to transmit RT bits of information in 
this phase: 

• Codebook: Generate 2^^+^ codewords independently 
such that the symbols within every codeword are chosen 
i.i.d. Ber [p). Label the codewords as x(m, k) where m e 
{!,..., 2«^} and k € {1, . . . , 2^'^}. 

• Encoder: For a given message m e {1, . . . , 2^^} and a 
state sequence s, find a k such that 



T/A 



l{(x,(m, k), s,) = (1, 1)} > (1 - e)P^^^^,T., 



pA + \ 



where I{ } denotes the indicator function so the left-hand 
side (LHS) is the number of slots where x(m, k) and s 



are both one. Send x(m, k). If no such k can be found, 
send an arbitrary sequence. 

Decoder: Find a codeword x{in',k') in the codebook 
such that, for the observed y. 



T/A 

E 

3 = 1 



l{{x,im',k'),y,) = (1,1)} 



>il-ef-^{pA + X + p)T. 



pA + X 

Output to'. If no such codeword can be found, declare 
an error 

We next analyze the error probabihty of this random codebook. 

There are three types of errors which we analyze separately: 
• The encoder cannot find a k such that x(to, k) meets 
the requirement. We know that the total number of 
stuck slots in this phase is p,T. Since the codebook is 
generated independently of the stuck slots, we know that 
the symbols of a particular codeword at these slots are 
drawn i.i.d. according to Ber (p). By Sanov's theorem 
[12], [13], for large T, the probability that a particular 
codeword satisfies the requirement, i.e., has at least 
(1 — e)^^i^/iT ones in these /iT slots, is approximately 
2MTD(Ber((i-0^)||Ber(rt)^ Therefore, when T tends 
to infinity, the probability of this error tends to zero if 
we choose 



R' > P^dI Ber (1 - e 



pA + X 



Ber(p) . (10) 



There are less than (1 - e)S^^{pA + A + p.)T slots 
where the transmitted codeword and the output are both 
equal to one. The probability of this error tends to zero 
as T tends to infinity by the law of large numbers. 
There is some x' which is not the transmitted codeword 
such that there ai-e at least (1 - e)^^^±^(pyl + A + pi)T 
slots where x'j ~ 1 and yj = 1. To analyze the 
probability of this error, we first note that, by the law 
of large numbers, when T is large, the number of slots 
where yj = 1 is close to {pA + A + fi)T. We also 
note that a particular codeword that is not transmitted is 
drawn i.i.d. Ber (p) independently of y. Therefore, again 
by Sanov's theorem, the probability that this codeword 
has at least (1 - e)S^^{pA + A + /x)T ones at the 
approximately {pA + A + p)T slots where yj — 1 
is approximately 2(P^+^+^)™(B-((i-)^)||Ber(p)) ^ 
Thus, when T tends to infinity, this probability tends to 
zero if we choose 



R + R' < (pA + X + ^i)- 



D\ Ber (1 - e) 



A) 



pA + X 



Ber {p) 



(11) 



By combining ( fTOl i and ( fTTI ) and noting that e can be chosen 
to be arbitrarily small, we conclude that, for every p e (0, 1), 
when T is large and when A is small, successful transmission 



in the information phase can be achieved with high probabihty 
as long as 



R<{pA + \)D Ber 



p{A + \) 
pA + X 



Ber (p) 



(12) 



Furthermore, since we have shown that the training phase can 
be accomplished with any positive a, the overall transmission 
rate, which is equal to jq^, can also be made arbitrarily close 
to the RHS of ( fT2] i. Optimization over p implies that we can 
achieve all rates up to the RHS of (|6]l with random coding. ■ 

B. Deterministic Codes against Random States 

We next consider the case where, rather than being an 
arbitrary sequence chosen by an adversary, the spurious counts 
are random. Such random counts can be modeled by a random 
counting process S{-) which is independent of the message, 
so the channel output is given by 

Y{t) = Z{t) + Sit), teM, (13) 

where Z(-) is a Poisson process whose time-t intensity is 
x{t) + A, and conditional on x{-), Z{-) is independent of S{-). 
We assume that S'(O) = with probability one and 



lim — !— ^ < oo. 

t^oo t 



(14) 



Note that these conditions are satisfied, for example, when 
S{-) is a homogeneous Poisson process. We also assume that 
the law of S{-) is known to both transmitter and receiver (and, 
in particular, the code may depend on the law of S{-)), while 
the realization of S{-) is known noncausally to the transmitter 
but unknown to the receiver A rate is said to be achievable on 
this channel if, for every T > 0, there exists a deterministic 
(T, R) code such that the average probability of error averaged 
over S{-) tends to zero as T tends to infinity. The capacity of 
this channel is defined as the supremum over all achievable 
rates. 

Theorem 2: The capacity of the channel ( fTsT l, where the 
realization of S{-) is known noncausally to the transmitter but 
unknown to the receiver, is equal to C(A, A), irrespective of 
the law of S{-). 

Proof: We first observe that the capacity of ( fT3] l cannot 
exceed C(A, A). This is because we can mimic the channel 
(fT3T l over a channel without spurious counts as follows: The 
transmitter and the receiver use common randomness (which 
does not help on the single user channel without states) to 
generate S{-) and then the receiver ignores its realization. 

We shall next show that any rate below C{A, A) is achiev- 
able on O. Fix any e > and R < C{A, A). Let 

E[Sit)] 



C — lim 



t 



Since ^ < cxo, there exists a to such that 

E[5(t)]<2C, t>to. 
Using this and Markov's inequality we have 



Pr 



S{t) > ^ 



< e, t> to- 



Thus the error probability of a (T, R) random code where 
T > to can be bounded as 



Pr [error] < Pr 

< Pr S(T) > 
+ Pr 

< e + Pr 



error, S{T) > 



2C 



Pr 



error, S{T) < 



2C 



2C' 



S{T) < 
S{T) < 



2C 



Pr 



SiT) < 



2C 



2C 



T>to. 



(15) 



To bound the second term on the RHS of ( fTSl ) we use 
Theorem [T] which says that there exists ti such that, for any 
T > ti, there exists a random (T, R) code whose average error 
probability conditional on any realization of S{-) satisfying 
s(T) < ^ is not larger than e. Therefore, for such codes. 



Pr 



S{T) < 



2C 



<e, T>ti. 



(16) 



Combining ( fTSl ) and ( fTSI l yields that for any T > max{to, ^i} 
there exists a random (T, R) code for which 

Pr [error] < 2e. 

Since this is true for all e > and R < C{A, A), we 
conclude that all rates below C{A, A) are achievable on ( [13] ) 
with random codes. 

We next observe that we do not need to use random codes. 
Indeed, picking for every T and R the best deterministic 
(T, R) code yields at worst the same average error probability 
as that of any random (T, R) code. Thus we conclude that any 
rate below C {A, A) is achievable on ( fT3] l with deterministic 
codes, and hence the capacity of ST3[ is equal to C{A, A). ■ 

IV. Causal CSI 

In this section we shall argue that causal CSI does not 
increase the capacity of a peak-limited Poisson channel. We 
look at the simplest case where the spurious counts occur as a 
homogeneous Poisson process of intensity /i. We shall be im- 
precise regarding the definition of causality in continuous time 
by directly looking at the DMC (OH Since the continuous- 
time state S{-) is a Poisson process, the discrete-time state 
sequence S is i.i.d. with each component taking the value 1 
with probability /iA and taking the value otherwise. 

To argue that having causal CSI does not increase capacity, 
we shall show that every mapping u from channel states to 
input symbols (as in Q) is equivalent to a deterministic input 
symbol in the sense that it induces the same output distribution 
as the latter. Indeed, since when s = 1 the input symbol x 
has no influence on the output Y , we know that the value 
of does not influence the output distribution. Therefore 
u: s u{s) is equivalent to the mapping that maps both 
and 1 to w(0), and is thus also equivalent to the deterministic 
input symbol x = u(0). 

^For formulations of causality in continuous time see [14] or [15]. 



In a more explicit argument, we use the capacity expression 
(O. For any distribution Pjj on U, we let 

Px{x) = ^^(^)- 

u: u{0)—x 

Then, for the above Pu and Px, 

I{U;Y) ^ I{X;Y). 

Therefore we have 

sup/([/;y) < sup I{X;Y), 

Pu Px 

where the LHS is the capacity of the channel with causal CSI, 
and where the RHS is the capacity of the channel with no CSI. 
Thus we conclude that the capacity of our channel model with 
causal CSI is not larger than that of the channel with no CSI. 
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